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Abstract 

We define a normal form (called the canonical image) of an arbitrary 
measurable function of several variables with respect to a natural group 
of transformations; describe a new complete system of invariants of such 
a function (the system of joint distributions); and relate these notions to 
the matrix distribution, another invariant of measurable functions found 
earlier, which is a random matrix. Bibliography: 7 titles. 

1 Introduction 

In the first section of the paper, we set the problem and introduce basic no- 
tions. The main part is the second section, where we introduce the system of 
joint distributions and prove the main lemma on the bases generated by a pure 
function. Using these notions and results, we define a normal form of a measur- 
able function of two (or more) variables and introduce the so-called canonical 
image of a function. This is an interesting, if not completely explicit, model of 
functions, which is partly similar to a random process with a set of measures as 
the parameter set. This notion, which is apparently new, is related to a number 
of interesting questions, which presumably provide a new method of studying 
measurable functions of many variables. The third (sketchily written) section 
links the previous analysis to the notion of matrix distribution (which is a mea- 
sure on matrices, or a random matrix) - another invariant of functions, which 
can be called randomization. More exactly, this is a method of classification 
oj functions via randomization; it generalizes Gromov's idea, developed by the 
author, on the classification of metrics on measure spaces. The construction 
of a measurable function with a given matrix invariant, announced in [1], is 
based on the ergodic method, systems of joint distributions, and the canonical 
image considered in Sec. 2. The author hopes to develop these ideas in another 
publication. 
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1.1 Setting of the problem 

Recall that the classification of measurable functions of one variable defined 
on a Lebesgue space {X, ii) with a continuous measure fi and taking values in 
a standard Borel space A with respect to the group of all measure-preserving 
transformations of the argument is almost obvious. Namely, a measurable, a. e. 
one-to-one function f : X ^ A is equivalent to the identity function on the 
space A endowed with the measure where is the distribution of / (i.e., 
the image of the measure /j, under /). This is exactly the normal ("tautologi- 
cal" ) form of an arbitrary measurable one-to-one function of one variable. It is 
not difficult to write a normal form of an arbitrary (not necessarily one-to-one) 
function. This is done in [2] (see also |4]) using the theory of conditional mea- 
sures. To find this normal form, one should consider the partition of the space 
{X, n) into the level sets of the function / and regard (X, /x) as a fiber bundle 
over A. Assigning to /*^-a.e. value a € A oi f the type of the conditional mea- 
sure on the corresponding level set f~^{a), we turn (X,^) into a fiber bundle 
defined only in terms of A and the family of these conditional measures; denote 
it by A. Then the normal form of a general measurable function of one variable 
is the projection F : [A,]!) — > {A, f*iJ,). 

In this paper, we study a similar classification problem for functions of several 
variables, which is far more difficult. It links up with the paper [3], but here 
we sketch a new method of analyzing measurable functions and relate it to 
the existing approach. The method consists in considering the system of joint 
distributions of sections of a function of several variables as an invariant with 
respect to a natural equivalence. 

Consider the product {X x Y, fi x i') of two spaces with continuous measures 
and a measurable function / on this product taking values in a standard Borel 
space A. We will always consider continuous measures /i, and the value space 
A will always be an arbitrary standard Borel space (finite, countable, or con- 
tinual; the latter case is the most interesting). We may assume without loss of 
generality that A is the interval [0, 1]; we will use only the Borel structure on 
this space. 

Definition 1. Two functions f and f defined on the spaces {X x Y, fj, x i>) and 
(X' X Y' , fj,' X v'), respectively, and taking values in the standard Borel space A 
will he called equivalent if there exist measure-preserving {Tfi — fi' , Sv = v') 
invertible transformations T : X X' , S :Y ^ Y' such that 

f'{Tx,Sy) = f{x,y), 

or, equivalently, 

/V,y')-/(7^"V,5-V). 

There are various special cases and generalizations of this notion, which can 
be studied by the same method. In the definition above, the group with respect 
to which we classify functions is the product of the groups of measure-preserving 
automorphisms of each component: 

Aut(X, /i) X k\it{Y,v). 



2 



We mention two more possibilities. 

Definition 2. With the above notation, assume that the factors coincide as 
measure spaces: X = Y and X' = Y' . In this case, we can consider the 
equivalence of functions with respect to automorphisms of the form T x T ; this 
classification will be called the diagonal classification of measurable functions: 
functions f and f are equivalent in this sense if there exists an automorphism 
T of {X, fi) such that 

f'{Tx,Ty) = fix,y), 

or, equivalently, 

nx\y')=f{T-^x',T-^y'). 
In other words, here we consider the diagonal action of the group Aut(X, /i). 

The diagonal classification is natural, for example, in the case of functions 
symmetric with respect to permutations of arguments. In particular, this setting 
incorporates the important problem of classification of metrics as measurable 
functions on a metric measure space; this problem is studied in detail and solved 
in [HIS]. 

The second classification problem for functions of two variables is related to 
the group of skew products, or the "skew equivalence." It is as follows. 

Definition 3. For simplicity, assume that two functions /(•, •) and /'(•, •) are 
defined on the same space 

{X xY, fixi/). We say that they are skew- equivalent with respect to the first vari- 
able if there exists a "skew product" automorphism G of the space {X xY, fixu), 
i.e., an automorphism of the form G{x, y) — {Tx, Sxy) where T : X ^ X is an 
automorphism of the space {X,fi) and {Sx}xex is a family of automorphisms 
of the space (Y, v), that is measurable with respect to x € X and satisfies 

f'(Tx,Sxy) = f{x,y), 

or, equivalently, 

f'{x', y') ^ f{T^^x', S~^y'), x = T^^x' . 

One can easily see that the skew equivalence problem can be reduced to 
the first problem with one variable less and the value space A replaced with 
the space of probability measures on A (i.e., with the space meas(v4), or, more 
exactly, Meas(A); see below). Hence it does not involve any essentially new 
effects. 

In a similar way we may state classification problems for functions of more 
variables; we can also impose additional restrictions (symmetries, inequalities, 
etc.) on the values of functions, and, which is especially important, vary the 
groups of automorphisms with respect to which the equivalence is being con- 
sidered. The difficulty of a classification problem depends substantially on the 
difficulty of the corresponding group as a subgroup of the group of all automor- 
phisms. In this paper, we mainly consider the first equivalence with respect to 
the direct product of the groups of automorphisms of the variables. 
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In [1], the direct and diagonal equivalence problems were solved in terms 
of so-called matrix distributions, i.e., S'oo- invariant measures on the space of 
distance matrices. Below we give a new proof of this result, which follows from 
the main theorem of this paper on the system of joint distributions as a complete 
invariant. 

1.2 Basic notions 

Definition 4. A function /(•, •) of two variables defined on a space (X xY, fixv) 
is called pure if the partition ( respectively, ^2 ) of the space (X, fi) ( respectively, 
{Y,v)) into classes of points x G X (respectively, y d Y) for which f{x,-) = 
f{x',-) v-mod (respectively, f{-,y) — f{-,y') /i-niod Oj is the partition into 
points mod 0. 

The purety of a function of two variables is a counterpart of the one-to- 
oneness of a function of one variable. The general case of the classification 
problem can easily be reduced to the case of a pure function by considering the 
quotient space oi [X xY, ^ x v) obtained from the partition ^1 V ^2 (see [4]). 
The group-theoretic meaning of the notion of a pure function is as follows. 

Proposition 1. A measurable function of two variables is pure if and only 
if its stabilizer with respect to the action of the groups AutQ(X, /i) x Id and 
Id X AutQ(y, i^) is trivial (i.e., consists of the group identity); here A\xtQ{X, pL) 
(respectively, PMiqiYjv)) is the group of transformations of the space (X, /i) 
(respectively, {Y,v)) with a quasi- invariant measure, which acts in a natural 
way (by substitutions of variables) in the space of measurable functions. 

A stronger condition (which is more difficult to formulate in terms of sec- 
tions) is as follows. 

Definition 5. A function is called totally pure if its stabilizer with respect to 
the action of the group AutQ(X, /i) x A\itQ{Y,v) is trivial. 

As we will see, the total purety of a function means that it has no symmetries 
of the form of direct products of substitutions of variables. Using this definition 
as a model, one can formulate similar purety conditions for actions of other 
groups. 

For the diagonal classification, the purety of / means that its stabilizer with 
respect to the diagonal action of the group AutQ(X, /i) is trivial. Note that, 
according to a theorem proved in [5] , the stabilizer with respect to the diagonal 
action of any function that is pure in the sense of the previous definition is a 
subgroup of Autg {X, /i) compact in the weak topology. It is not difficult to 
formulate the notion of purety and its group-theoretic meaning for the skew 
equivalence. 
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2 Equivalence invariants 



2.1 Constructing the system of coherent distributions 

In what follows, we speak only about the main equivalence, leaving it to the 
reader to reformulate the results for other equivalences. 

We will define a system of characteristics that uniquely determines a pure 
measurable function of several variables up to the equivalence described above. 
In fact, this is the system of joint distributions, whose definition is similar to 
Kolmogorov's definition of a random process; however, its peculiarity is that 
the parameter set ("time") is a measure space, with all that this entails. Never- 
theless, for this "pseudoproccss" one can also state an analog of Kolmogorov's 
theorem on the existence of a measure on "trajectories" ; moreover, since the 
parameter set is a Lebesgue space, the corresponding process is separable in 
spite of the fact that the base is continual. Since we will not use this notion in 
what follows, we provide almost no details, but we find it important. One of 
the questions is when this definition gives a function of several variables. 

First we prepare necessary spaces. Recall that we endow the space 

A" = A X • ■ • X 4 , n=l,2,..., 
n factors 

with the Borel structure of the product of spaces (generated by the products 
of Borel sets on the factors), and by meas(A") denote the simplex of all Borel 

probability measures on the Borel space n=l,2 *** We define Borel 

homomorphisms VJ^ and Vy of the spaces X" and Y", n = 1, 2 . . . , to the space 
meas(^") as follows. For n = 1, we set Vx{x) = al € meas(A) (respectively, 
Vyiy) = G meas(A)), where al (respectively, a^) is the distribution of the 
section f{x, •) regarded as a function of one variable y £ Y with respect to 
the measure v (respectively, the section f{-,y) regarded as a function of the 
variable x G X with respect to the measure /it). The fact that these maps are 
Borel follows from the measurability of the function. 

Let n > 1. To almost every, with respect to the measure /u" = /U x • ■ • x /U, 



collection of points (a;i, . . . , Xn) G X" (respectively, (yi, . . . y„) S Y"), we asso- 
ciate the measure al'^ on A" that is the joint distribution of the family of 
sections {f{xi, •), f{x2, •),•••, f{xn, •)} regarded as an ^"-valued vector function 
on (Y,!/); in the same way we define the distributions ciy^^...,y^- Denote these 
families by A;^ = {al_^ {xi} E X", n = 1, 2, . . . } (respectively, Ay(/) for 

the second variables); obviously, they are coherent in the natural sense: the 
projection of the measure on A" along the ith coordinate coincides 

with the measure at ^ ^ on A"^^. Each of the two families A, regarded 
as a collection of measures on A", n ~ 1,2,..., will be called the system of joint 
distihutions (SJD) of sections of the correspondent variable; both families to- 
gether will be called the system of joint distributions of sections. In the similar 
definition for functions of more than two variables, one should consider sections 
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of codimension one (rather than one-dimensional sections). 

Remark 1. The joint distribution of sections is, obviously, well defined with 
respect to modO, i.e., the family A depends only on the class of functions coin- 
ciding modO. 

Remark 2. It is obvioiis from the definition that the SJD is invariant under the 
equivalence introduced above; more exactly, if two functions (defined on different 
spaces {X x Y,fj,i x /i2) and {X' x Y', 

Hi X iJi'2)) arc equivalent and this equivalence is implemented by a measure- 
preserving transformation TxS:XxY^X'xY', then the same transfor- 
mation sends the SJD of the first function to the SJD of the second function. 

Fix a Borel set B in A" and consider the following measurable subsets in X 

and Y: 

= {x\ 3{yi,y2,...,yn) G Y"" : (/(x, yi), /(a;, 1/2), • • • , /(a;,y„)) e B}, 
Cl = {y\ 3{xi,X2,...,Xn) G X" : {f{xi,y),f{x2,y),...,fixn,y)) G B}. 

Lemma 1 (on bases). Let f be a pure function of two variables. The family of 
sets {C§ } where B ranges over a countable basis of Borel subsets in A" and n 
goes from one to infinity is a countable basis of the a-algebra of measurable sets 
in {X, fi); the same is true for the family of subsets {C^} in {Y, u). 

Proof. We must prove that the cr-algebra spanned by the sets {C§ } separates 

points mod in the space {X, /i) . But this is equivalent to the following asser- 
tion. Consider the equivalence relation defined as x x' 4^ f{x,yi) = f{x',yi), 
i = 1, . . . , n, for almost all collections ...,?/„ G F with respect to the measure 
i^" and all n = 1, 2, . . . . By the ergodic theorem (more exactly, by the strong 
law of large numbers), this means that f{x, ) = f{x',-) a.e.; by the purety 
of /, the latter relation determines the partition of {X, fi) into points mod 0. 
Therefore, the family {Cg } generates the full cr-algebra of measurable sets in 
the space (X, /x), and the product {C^} x {C^,} generates the full cr-algebra 
of measurable sets in the space {X x Y, fj, x v) . By construction, the measures 
of the sets C§ and Cg are determined by the SJD of /; hence a measurable 
function / uniquely determines the measure by these data; in turn, it is also 
uniquely determined by the family of distributions of sections. In other words, 
if we have two measurable pure functions /, /' on the space {X xYjfixv) with 
the same SJD, then the functions coincide mod 0. □ 

2.2 Completeness of the system of invariants 

The next theorem states that the SJD is a complete system of invariants of a 
pure function of two variables. 

Theorem 1 (Uniqueness). A pure function is uniquely determined mod by 
its SJD in the following sense. Assume that two pure functions f and f defined 
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on spaces {X xY, u) and {X' x Y', /j.' x u'), respectively, have the same SJD, 

M(Cf (/)) = (/')), = ^'{Cl'if')) 

for all basis Borel sets B c and all n = 1,2, Then the functions f and 

f are equivalent. 

In particular, if two functions are defined on the same space and their SJD 
coincide, then the functions coincide mod 0. 

Proof By the lemma, the SJD of each of the pure functions / and /' is a basis 
in the (T-algobra of the respective space. Then, as mentioned at the end of 
the previous proof, the correspondence between the sets C§ (/) and Cq (/') 
determines an isomorphism between the spaces {X,fi) and {X'.fj') that sends 
the measure /x to /i'. A similar assertion holds for the spaces (Y, v) and {Y' , v'), 
and the product of these isomorphisms is an isomorphism between the spaces 
{XxY,iixv) and {X' xY' , fi' x v') that provides an equivalence of the functions 
/ and /'. □ 

Remark 3. Writing an explicit formula for the values of a function / in terms 
of its SJD in the general case is a difficult task. For an illustration, we give a 
concrete example of such a problem. Assume that we are given a metric mea- 
sure; space: find the distance between given two points (i.e., recover the metric) 
knowing all joint distributions of the distances to any given finite collection of 
points. 



2.3 On systems of joint distributions and pseudoprocesses 

In fact, one can describe exactly to what extent the SJD with respect to one 
variable determines the second SJD; namely, given the SJD with respect to one 
variable, a function can be recovered up to one transformation. 

Theorem 2. Two measurable functions f\ and /2 defined on a space {X x 
Y,nx v) that have the same SJD in one (say, the first) variable are related by 
the following formula: 

f2{x,y) = fi{x,Ty), 

where T -.Y ^Y is a measure-preserving invertible transformation of the space 
{Y,u). 

Proof. Consider the limits of joint distributions in the variable x. For fixed 

Xi,X2, . ■ . ,Xn, the corresponding joint distribution is a measure Q:xi,...,xn on 
A", i.e., on finite ordered collections of points from A. But as n — >■ oo, we 
obtain a limit measure that is concentrated on measures (in general, continuous) 
on A. To see this, it suffices to regard axi,....x„ &s measures on the empirical 
distributions at the points Xi, i = 1, . . . ,n; then their limits will be distributions 
on A. But the family of these distributions determines the distributions of the 
functions {/(■,?/)}; more exactly, this is a measure on the set of distributions of 
A- valued functions in x; we lack only a correspondence between points y ^ Y 
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and these distributions. Any two such correspondences are sent to each other 
by a measure-preserving transformation. Thus the SJD in the first variable x 
determines a function / of two variables up to a transformation of the second 
variable, i.e., a measure-preserving transformation of the space iY, v). □ 

In fact, the SJD may be regarded in the framework of the following general 
definition, which does not involve any function. 

Definition 6. Consider a standard measure space {X, n) with a, continuous mea- 
sure and a standard Borel space A. Denote {X, n)" — (X", /z"), n = 1, 2, . . . . A 
random pseudoprocess is a coherent system of measurable maps (more exactly, 
modO classes of maps): 

Vn : {X,nr ^A^, n = l,2,..., 

Vn{xi,X2,...,Xn) = Q;(a:i,X2,...,x„) S meas(A"), 

where meas(-) is the simplex of probability measures on the corresponding Borel 
space; the coherence condition means that 

E^^a'^{xi,...,Xn) = a"~'^{xi,...,Xi,...,Xn), n> 1. 

In particular, given a measurable function of two variables, above we have 
defined such a random pseudoprocess (in each variable) . In much the same way 
as a coherent system of joint distributions determines, according to Kolmogorov, 
a measure on the space of trajectories, here we have defined a pseudoprocess as 
a measure, but not on the set of trajectories, but, most likely, on measurable 
A- valued functions of points of the parameter set. Quite possibly, this notion 
appeared earlier as a process on a parameter set that is a Lebesgue space. Note 
that in spite of the continuality of the parameter set, the cr-algebra on which the 
corresponding law is naturally defined is separable, by the measurability of the 
maps Vn- The properties of pseudopro cesses and the question about whether a 
given pseudoprocess determines a function of two variables, or a polymorphism, 
or a more general object, will be considered elsewhere. Here we confine ourselves 
to a remark anticipating the constructions of the next section. The canonical 
image of a function of two variables, which is defined below, is essentially a 
realization of this function as a pseudoprocess, or a random field, over a measure 
space; as such a space, one can take the set of all (one-dimensional) distributions 
of sections of the function in one of the variables, and the construction requires 
only knowing all joint distributions of sections of the function. 

2.4 Description of the model 

Now we describe a normal form for the equivalence under consideration, which 
will be called the canonical image of a pure measurable function of two variables 
taking values in a Borel space A. 

First we construct a product space on which the normal form will be defined. 
This is the square of the space Meas(A), which we are going to define now. 
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Consider the simplex nieas(A) of all Borel probability measures on the Borel 
space A endowed with the weak topology determined by a countable basis of 
the a-algebra of Borel sets. 

The space Meas(A) is the (topological) direct product of the simplex meas(A) 

oo 

and the compact space S'=[0, 1]U (J the union of an interval and 

a countable set of points. It is convenient to regard Mcas(A) as the trivial 
fiber bundle over meas(A) with fiber S and denote by tt the natural projection 
Meas(j4) — >■ meas(A). The space Meas(A) is compact in the natural topology. It 
can be interpreted as the space of Borel probability measures on A with multi- 
plicities, or as the space of marked measures, the fibers being the sets of possible 
marks. 

Now we describe the class of measures we will consider on this space. 

Definition 7. A Borel probability measure on Mcas(A) will be called suitable 
if it has the following form: its projection to meas(A) is an arbitrary Borel 
probability measure on the simplex meas(A) (i.e., a "measure on measures"), 
and the conditional m,easures on fibers S turn them, into general Lebesgue spaces, 
i.e., the measures on the intervals are continuous Lebesgue measures, and the 
measures of the atoms are nonincreasing in n. 

Note that a suitable measure is uniquely defined by its projection to meas(A) 
and the types of the conditional measures on the fibers. It is of importance that 
a suitable measure admits a group of automorphisms preserving the measures 
on the fibers; this group may be trivial for some fibers or for all fibers (the 
latter is the case if the given distribution appears once as the distribution of a 
section). This group will be called the fiber group of a suitable measure; if a 
suitable measure has fibers with continuous components, then the fiber group 
is nontrivial. 

Definitions. A measurable function F defined on the space Meas{A)xMeas{A) 

endowed with the product ^ x v of suitable measures and taking values in the 
Borel space A satisfies the tautology condition (or is called tautological) if the 
distributions of sections of the function y i— ^ F(x,y) (respectively, x M> F{x, y) ) 
for almost all points x G Meas(j4) (respectively, y G Moas(A)/. regarded as 
functions on Meas(A) of the other variable, coincide with the measures 7r(a;) G 
meas(j4) (respectively, ir^y) G meas(A)^ on A. 

2.5 The canonical image of a function 

Using the systems of joint distributions, we present a procedure for successively 
refining images of a function so that they converge to its canonical image. 

Theorem 3. For every pure measurable function f defined on an arbitrary 
product of two Lebesgue spaces with continuous measures and taking values in a 
Borel space A there exist unique suitable measures fif and Vf on Mcas(A) and a 
unique measurable, pure, tautological function Can (/) on the space (Meas(A) x 
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yieas{A), fif X Vf) such that f is equivalent to Can/. The latter function will 
be called the canonical image of f- The measures jif and Vf are invariants of 
the class of equivalent functions. 

Thus tautological functions provide a normal form for pure functions; in 
particular, the theorem says that equivalent pure functions have the same, up 
to an isomorphism of fibers (see below), canonical image, and nonequivalent 
functions have different canonical images. Another construction of this model 

of functions will bo described in the last section. It is based on another approach 
to equivalence invariants and differs from the construction in the proof of this 
theorem in that it automatically invokes the space Meas and marking; more 
exactly, the arbitrariness caused by automorphisms of fibers is replaced with 
the arbitrariness in the choice of the initial matrix involved in the construction 
of that model. 

Proof. Let / be an A- valued measurable pure function on {X x Y, ^ x u). We 
will denote by f^\x{y) = f{x,y) (respectively, f^\y{x) = f{x,y)) the function 
obtained from / by fixing the first (second) variable. Given a function g on 
a measure space with a measure 7, the notation dist((7) = (7,7 stands for the 
g-image of 7 on the value space, or the distribution of g. 

Consider the maps Lj^ : X ^ meas(j4) and : F — ;> meas(A) defined by 
the formulas Li{x) = dist(/-^ |a;), ^2(2/) = dist(/^|y); that is, the image of a 
point X (respectively, y) is the distribution of the function /^|^ (respectively, 
/^ly) regarded as a measure on A (i.e., an element of meas(A)). 

Each of these maps gives rise to the quotient space of {X,iJ,) (or {¥,1/)) 
constructed from the partition of this space into the classes of points for which 
the corresponding distributions coincide. These partitions are invariant, in the 
sense that every equivalence preserves them. 

But it obviously follows from the definition of the space Meas that the maps 
Li,L2 can be lifted (in a way that is no longer invariant) to isomorphisms of 

spaces Li : X ^ Meas(A), L2 : Y Meas(A); that is, and Lj are isomor- 
phisms mod 0, which means that the images of different points x,x' (respec- 
tively, y,y') with the same distributions dist (/^|^.) = dist (/^|^,) (respectively, 
dist (/2| ) = dist (/2| ,)) in Meas will be different mod 0. 

This condition can obviously be satisfied, since Me&s{A) is a fiber bundle over 
meas(^) with fibers that can be turned into an arbitratry Lebesgue space, but 
the isomorphism is defined only up to fiberwise automorphisms. The measure 
/z X 1/ is transferred to Meas{A) x Meas(A), and its projection to meas(y4.) is an 
invariant of the equivalence. 

Fix a choice of isomorphisms and denote (I/j)*/x = fif and (Lj)*z^ = Uf. 
Thus we have constructed an isomorphism 

Lj X Lj : {X xY,ij,x ly) ^ (Meas(A) x Meas(^),/U/ x i^f). 

Now consider the classes of points in X and in Y with the same distributions 
dist(/^|a;) = dist(/^|a;') (respectively, dist{f^\y) = dist(/2|yO) mod 0. With 
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each such class C wc can do the following. Divide its square C x C into classes 
such that inside each class the joint distribution corresponding to the coordinates 
ci and C2 is the same. Thus we can refine our isomorphisms on the classes by 
requiring that the classes should be preserved. Then we do the same with the 
triple joint distributions, etc. It follows from Theorem 1 on the completeness of 
the SJD that (for a pure function) the product of all such partitions converges 
to the partition into points. Hence this process yields unique isomorphisms Li : 
X — > Meas(A) and L2 :Y Meas(A), which will also transfer the measures fj,, 
V and the function / to the image. Denote these images by /x/, i//. Thus we have 
obtained an invariantly defined measure space (Meas(^) x Meas(^),/i/ x vf) 
and a function on this space denoted by Can (/) and called the canonical image 
of /. The function Can (/) is tautological, as follows from the first step of the 
construction. The uniqueness follows from the same completeness theorem. 

Applying our construction to the spaces X = Y = Meas(A) and suitable 
measures \i and v on Meas(A), and taking an arbitrary pure tautological func- 
tion on (Meas(^) x Meas(7l),/i x v) as F, one can easily see that the functions 
L}^ and can be assumed to be identity functions. Hence every pure tauto- 
logical function on the space {Meas{A) x Meas{A), fi x u), where n and v are 
suitable measures, is a canonical image (of itself ). □ 

Thus the class of tautological functions on the space Mcas(^) x Mcas(yl) 
with some suitable measures /x, v is the class of all possible normal forms of 
pure functions of two variables: every pure function is equivalent to exactly one 
function from this class. 

Let us make several remarks concerning the canonical image. 

1. The final space (Meas(A)xMeas(A),/u/xz^/) and the function / are indeed 
canonical, since their constructions does not involve objects noninvariant with 
respect to the equivalence. Actually, the fact that the image is canonical follows 
directly from the purety of the function, since the stabilizer in each factor is 
trivial, so that the model is uniquely defined. 

2. Consider the simplest special case. Assume that the function is superpure, 
i.e., the distributions of the sections f{x, •) (respectively, /(•,?/)) for almost all 
points X £ X (respectively, y &Y) are distinct. Then the isomorphism between 
(X, /It) (respectively, {Y,n)) and meas(^) that sends a point x (respectively, y) 
to the measure on A that is the distribution of the function /(x, •) (respectively, 
f{-,y)) is the required one. In this basis can be constructed only from 
one-dimensional distributions. To clarify the general case, one may say that 
multidimensional distributions are needed in the definition of the isomorphism 
only because they allow one to split distributions having multiplicities in the 
SJD. However, having constructed an isomorphism, we must give an explicit 
description of the function in terms of the canonical image, and in all cases, 
including the simplest one considered above, this is done via the SJD. It would be 
interesting to classify measurable functions according to the minimal collection 
of joint distributions needed to recover it. 
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3. The space ineas(^) x ineas(yl) is a topological space (the product of sim- 
plices), as mentioned above. But the space Meas{A) x Meas(A) is also a compact 
space, since Meas = meas x S. One can define a (formally) different topology 
using two-dimensional, rather than only "one-dimensional," distributions. For 
this it suffices to define a neighborhood of a given point as the set of points such 
that the two-dimensional joint distribution corresponding to these two points is 
concentrated, in some sense, near the diagonal. 

4. If the value space A is endowed with a topology, then we can define 
the notion of the measurable continuity of a function, or, more exactly, of its 
canonical image. This continuity means that if the two-dimensional distribu- 
tions corresponding to two points are close, then the values of the function at 
these points are also close. 

Example. Consider a metric d on a Borel space A and define a metric on 
Meas(A) = X hy setting dr{x,y) = Ea^ ^ r{u,v), where u,v ^ A and the 
expectation is with respect to the joint distribution Ux^^x^- In & similar way we 
define a metric on F = Meas, and then ou X xY (the sum of the metrics on the 
coordinates). We say that a function is measurably continuous if it is continuous 
in this metric. This can be briefly formulated as follows: if the mean values of 
the functions of one variable obtained by fixing the second variable are close, 
then the values of the functions at the given point are close. 

Presumably, an admissible metric regarded as a function of two variables on 
a measure space is measurably continuous. 

Conjecture 1. A symmetric measurably continuous function of two variables 
on the space {X x X, n x fi) has a well-defined restriction to the diagonal. More 
generally, for every measurable partition, every measurably continuous function 
of two variables has a well-defined restriction to almost every element of the 
partition. 

3 Relation with random matrices: classification 
via randomization 

3.1 The recovery theorem 

Recall the main definitions and results of [4 . As above, let / be a pure function 
of two variables {x, y) defined on a space X xY with a product measure fi x v 
and taking values in a Borel space A. Consider the infinite product {X°° x 
Y°° , X with the Bernoulli measures and define a map 

Ff : (X°° X r°°) ^ Afoo(^), 

where Moo (A) is the space of ^-valued infinite matrices, by the formula 

F{{x..}T,{yj}T) = {fi^.,yj)m=i- 
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The map F is well defined with respect to mod and sends x v°° to a 
measure Df, which was called the matrix distribution of /. Obviously, Z?/ is a 
Borel probability measure on Mao{A) that is invariant under the group of all 
permutations of rows and columns of the matrix and ergodic with respect to 
this group. This follows from known facts on Bernoulli measures. 

Theorem 4 (4 ). The measure Df is a complete invariant of the Junction f 
with respect to the equivalence; in other words, two pure functions f and /' are 
equivalent if and only if the measures Df and Dfi coincide. 

Note also that the map F is equivariant with respect to the square of the 
symmetric group S^x Sn; in the space Moo{A), this group acts by permutations 
of rows and columns. The following simple assertion holds. 

Proposition 2. The map F is an isomorphism if and only if the function is 
totally pure (see the definition in Sec. 1), i.e., its stabilizer in the product of 
groups Autg {X, fi) x Autg ( Y, v) is trivial. 

In this case, the map F yields an isomorphism between the (Bernoulli) ac- 
tions of the square of the symmetric group in these spaces. In the case of a 
pure function /, the kernel of F consists of the orbits of the symmetry group of 
/, and, as was observed (see [6]), the symmetry group is compact, so that the 
partition into orbits is measurable. Moreover, one can construct the quotient 
space of {X x Y, fi x v) under the partition into orbits, but it will no longer 
have the structure of a direct product. Curiously enough, in spite of the fact 
that for pure, but not totally pure, functions, Ff is not an isomorphism of the 
spaces {X°° x Y°°,n°° x v°°) and {Moo{A),Df), nevertheless, as one can see 
from the proof of the recovery theorem and the lemma on bases, a basis of the 
space {X xY, fix v) can be recovered from the basis of the image {Moc,{A), Df). 

The construction below gives, in particular, another proof of the recovery 
theorem; it uses Theorem 1 and clarifies the ideas suggested in [4]. 

With obvious modifications, we can also state a similar theorem on the 
diagonal equivalence. In this case, we should take one sequence {xi}"^ of in- 
dependent random variables distributed on X — Y according to the measure 
jjL = v. If {X, jj) is a metric measure space and / is a metric regarded as a 
measurable function on (X x X,fi x fi) (which is obviously pure), this is the 
theorem saying that the corresponding measure on the space of metrics on the 
set of positive integers is a complete invariant of metric triples (metric measure 
spaces). This was proved in [I] in a somewhat different form, and then in [5], 
by a simpler (ergodic) method, in this form. 

Thus we have two types of complete invariants of the equivalence of arbi- 
trary pure functions of two variables, and thus two methods of classification 
of measurable functions. The first one is provided by the matrix distribution, 
and it can be called the classification via randomization, since the invariant 
is a randomized net of values of the function. Although the definition of this 
invariant is very simple, it does not provide a model of the function, i.e., a 
method of recovering the function from this invariant, a random matrix. Such a 
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recovery procedure will be described below. The second method, considered in 
this paper, is more cumbersome; it is the classification via the system of joint 
distributions, and it allows one to describe a normal form (the canonical image 
of a function) , which is a model of the function. 

In the next section, we will combine both methods; namely, we will show how 
one can construct the canonical image from the matrix distribution, i.e., give 
an explicit construction of a function with a given invariant. The construction 
relies on the ergodic method [3] ; more exactly, we systematically use the ergodic 
theorem for an action of the symmetric group, or, more specifically, successively 
construct empirical distributions. Another version of constructing a realization 
of a measurable function from its matrix distribution was considered in the 
unpublished manuscript [?]■ In terms of the construction below, it uses only 
the first averaging process, and hence is also less cumbersome, but ignores the 
system of joint distributions. 

3.2 Constructing a function from its matrix distribution 

Let Dhe a given Borel probability measure on the set of infinite matrices Mn(A), 
which is the matrix distribution of a measurable function taking values in a Borel 
space A and defined on a space {X xY,fixiy); we will not use this space in any 
way. The construction below uses only the fact that the measure D is invariant 
and ergodic with respect to the action of the group x (the square of 
the symmetric group) by permutations of rows and columns. The construction 
proceeds as follows (this is just an application of the ergodic method). We 
choose a "typical" matrix with respect to the measure D (i.e., almost every 
matrix with respect to this measure) and construct a space, a measure on this 
space, and a required function. 

Only then we use the fact that D is the matrix distribution of a (totally 
pure) function and give a direct description of measures D (called simple in [4]) 
that can be matrix distributions. In this case, we will see that D is exactly 
the matrix distribution of the constructed function (and hence the latter is 
equivalent to the original function). Thus we will construct the canonical image 
of a function with a given matrix distribution, and the relation to the previous 
section is that we construct the SJD of the function. The construction proceeds 
in several steps. 

1. Denote the matrix by i? = {^i,j}i°j=i- By the invariance of the measure D 
with respect to the group of permutations of columns, we can apply the ergodic 
theorem to the action of the group S'n, considering the "empirical distribution 
of rows" : 

^™ ~r Yl XBi{r^,g-^l)}) = rriBiB). 

n— >oo 77,1 " ^ ' 

Here i? is a cylinder set in A°° and xb is its characteristic function; the formula 
counts how frequently the first column falls into B when the matrix is being 
permuted. Thus we obtain a probability measure on the space AN; of course, 
it depends on the matrix R. 
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2. Using the invariancc of D with respect to the group of permutations of 
rows, we find the "empirical distribution of measures." In more detail, we lift 
the permutational action of the group on rows from the space to the sim- 
plex meas(j4^) of measures on this space, consider the orbit of the constructed 
measure mu on with respect to this action, and find the empirical distribu- 
tion of measures on this orbit. For this we should consider a basis in the space 
of Borel sets on the simplex of measures. Let B he a. Borel set in the space of 
measures. Then the empirical measure of this set (of measures on A^) IS given 
by the formula 



In this formula, the action of an element g G Sn is applied to the measure niR. 
The limit exists by the same reasons for Z)-almost all matrices R; in terms of 
R, this is a double limit, and its existence follows from the invariance of D 
with respect to the second (row) component of the group 5^ x 5^. We have 
constructed a measure on meas(yl^) (i.e., a measure on measures on A^) induced 
by the measure rriR and, indirectly, by the matrix R. Denote it by M^. 

3. Now wc do the same in the other order. First, using the same matrix R, 
we construct the empirical measure on the columns (using the invariance 
under the group of permutations of rows); this is again a measure on ^4°°. 

4. Then, as at Step 2, we take the average of over the columns to obtain 
a measure on meas(A^), i.e., a "measure on measures" similar to Mr. 

5. The next important assertion is that each of the measures M/j and M^, 
which were initially defined as measures on meas(A°°), induces a pair of mea- 
sures on MeasA" for all n = 1,2,..., namely, the finite-dimensional (with 
respect to meas(A°°)) empirical distributions on measures on A"; we denote 
them by and ly'^. For n = 1, they play exactly the same role as the measures 
fXf and i/f on MeasA, which were introduced in Sec. 2 in the definition of the 
canonical image of a function. Using these measures, we obtain the SJD of the 
future function and, consequently, the function itself. This is the transition to 
the canonical image. We do not describe what systems of joint distributions 
determine measurable functions. In the previous section, we moved in the op- 
posite direction and constructed the SJD from a given function. One can show 
that the measures Mji and also determine the function, but we do not use 
this fact below, since the existence of the function is already guaranteed and 
the problem is how to realize it. Denote this function on MeasA x MeasA by 
fa; it is the goal of our construction. As yet, we have used only the invariance 
of the measure D with respect to the square of the symmetric group. 

Now we state the main results on this function and on the whole construction. 
First we define a class of measures on the space of matrices specified by a 
condition called the simplicity of D. 

Definition 9. A measure D on the space of matrices Mfi{A) that is invariant 
and ergodic with respect to the square of the symmetric group is called simple if 




15 



the procedure described above that with a matrix R G {Aifi{A),D) associates the 
pair of measures 

{mR,m^) e Meas(A°°) x Meas(yl°°) 
is an isomorphism of measure spaces 

{Mn{A),D) ^ (Meas(A°") x Meas{A°°),MR x M^). 

The difference between pure and totally pure functions in the context of 
matrix distributions was observed by U. Habock in 2005; it is not mentioned 
in the original paper [3] , which caused the appearance in that paper of a false 
assertion that the matrix distributions of pure functions are simple measures; it 
was corrected in [^. This difference forces us to define not only simple measures, 
but also semisimple measures, which are the matrix distributions of simple func- 
tions. A direct description of semisimple measures in the same terms as in the 
case of simple measures is hardly possible, the reason being that taking the 
quotient with respect to the compact subgroup of symmetries of a function vio- 
lates the structure of a direct product, so that the described procedure yields a 
function defined on a more complicated object, since its variables are no longer 
independent. But the case of simple measures and, accordingly, of totally pure 
functions, is exhausted as follows. 

Theorem 5. 1. // the measure D is ergodic, then the measures Mr and 
on meas(A°°) and, consequently, the measures ^r,vr and the function fR con- 
structed above coincide for D-almost all matrices R. If D = Df is the matrix 
distribution of a totally pure function f , then the constructed function fR is 
equivalent to f, and its matrix distribution coincides with D. Thus the canoni- 
cal image of fR is a realization of the function with a given matrix distribution. 

2. The class of simple measures coincides with the class of matrix distribu- 
tions Df of totally pure measurable functions f. 

Proof. The first assertion in Claim 1 follows immediately from the ergodicity of 
the action of the group Sjq x Sjq. The remaining part of Claim 1 is reduced to 
more detailed comments on the construction, which is a proof. Comparing it 
with the construction of the canonical image shows that the required function 
is constructed via the SJD, as we have already mentioned. 

Now we proceed to Claim 2. Assume that we have a totally pure func- 
tion and we have chosen a typical realization of a sequence of i.i.d. variables 
Xi,X2, ■ . ■ ',yi,y2, ■ ■ ■ ■ Consider the matrix R = {f(xi, yj)} and keep track of 
the procedure described in Sec. 3.2. The first step, finding the measure niR, is 
equivalent to computing the joint distributions of the countably many functions 
f{xi, •), f{x2, •)>•■• regarded as functions of the second variable. The measure 
niR on A°° is exactly the joint distribution of these functions. The second step, 
computing the empirical measure from {2;^}^, determines the measure Mr (a 
measure on measures), but it is nothing else than a measure on the distribu- 
tions of the functions f{-,y)', more exactly, this is the image of the measure 
(Y, v) under the map y distx[f{-,y)], where dist^; stands for the distribution 



16 



of a function of x with a fixed y. The second series of operations with the roles 
of rows and columns interchanged is interpreted in the same way. Finally, as 
we have already observed, in the case of a totally pure function the map Ff 
is an isomorphism, so that the map from the space of matrices endowed with 
the measure Df to the space of empirical measures in the definition of a simple 
measure is also an isomorphism. 

The converse obviously follows from the construction. Namely, consider a 
simple measure D on M^{A); by the isomorphism of spaces in the definition of 
a simple measure, we do not lose information when constructing the function 
fn = /i E^nd the obtained function is totally pure, since the map _Fy^ is also an 
isomorphism. To see that the matrix distribution of this function is the measure 
D, one can look through the construction of /r from the SJD: in fact, the finite- 
dimensional restrictions of the measures Mfi and provide an approximation 
of the matrix distribution by finite-dimensional distributions. □ 

One can easily see that the described construction is an exact reproduction of 
the procedure describing the construction of the canonical image of a function; 
it is even somewhat more formalized, since here the space Meas (accounting for 
the multiplicities of distributions) appears automatically. Thus, given a matrix 
R, we have constructed the SJD, which, according to the theorem on bases, 
allows one to recover the hmction and hence the matrix distribution. 
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